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Abstract 

The reconstruction of the parameter of the model by the measurement 
of the random variable depending on this parameter is one of the main 
tasks of statistics. In the paper the notion of the statistically dual 
distributions is introduced. The approach, based on the properties 
of the statistically dual distributions, to resolving of the given task is 
proposed. 
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1 Introduction 

As shown in refs. (Bityukov 2000, Bityukov 2002, Bityukov 2003), in frame 
of frequentist approach we can construct the probability distribution of the 
possible magnitudes of the Poisson distribution parameter to give the ob- 
served number of events h in Poisson stream of events. This distribution is 
described as Gamma-distribution with the probability density, which 

looks like Poisson distribution of probabilities, namely, 

gM = -q-e - ", n > 0, h > -1, (1) 



where /i is a variable and h is a parameter (in case of Poisson distributions 
/j, is a parameter and n is a variable). It means, as shown below, that we can 
estimate the value and error of Poisson distribution parameter by the mea- 
surement of mean value of the random variable of this Poisson distribution 
and by using the correspondent Gamma-distribution. 

Let us name such distributions, which allow to exchange the parameter 
and the variable, conserving the same formula for the distribution of proba- 
bilities, as statistically dual distributions. 

Note, it is pure probabilistic (and, in this sense, frequentist) definition. 
As shown below, the properties of considered in this paper of two pairs of 
statistically dual distributions coincide with properties of conjugate families 
(which defined in frame of Bayesian Theory (for example, Bernardo, 1994)). 

In the next Section we show that Poisson and Gamma- distributions are 
statistically dual distributions and that normal distribution with constant 
variance is statistically self-dual distribution. Despite little differences in 
formulae, we also point on the interdependency between the negative bi- 
nomial and (3— distributions, which allows to consider these distributions 
as statistically quasi-dual distributions. The interrelation of the statistical 
duality and the estimation of the distribution parameters are discussed in 
Section 3. 



Let t(x, y) be a function of two variables. If the same function can be rep- 
resented both as the distribution of the random variable x with parameter 
y and as the distribution of the random variable y with parameter x, then 
these distributions can be named as statistically dual distributions. 

The statistical duality of Poisson and Gamma-distributions follows 
from simple discourse. 

Let us consider the Gamma-distribution with probability density 



At change of standard designations of Gamma-distribution — , a and x 

P 

for a, n + 1 and [i we get the following formula for probability density of 
Gamma-distribution 



2 Statistically dual distributions 




X 



(2) 



1 



2 



„n+l 

9n(a,n) = ^— — e-*vr, (3) 

where a is a scale parameter and n + 1 > is a shape parameter. Suppose 
a — 1, then the formula of the probability density of Gamma-distribution 
i\ n+ i is (here we repeat Eq.l) 

a n 

9n{n) = -r e ~^ A* > °> w > -1. 
n! 

It is usually supposed that the probability of observing n events in the 
experiment is described by Poisson distribution with parameter fi, i.e. 

/(n;/i) = ^e-' t , /x > 0, n > 0. (4) 

One can see that the parameter and variable in Eq.l and Eq.4 are ex- 
changed in other respects the formulae are identical. As a result these distri- 
butions (Gamma and Poisson) are statistically dual distributions. These 
distributions (Bityukov 2002) are connected by identity (see, also, identities 
in refs. (Jaynes 1976, Frodesen 1979, Cousins 1995)) 



rfl2 TO rfll m 

/ g m (^)d^+ /(«; A*2) +/ 9n{^)dn- Yl ^i) = > ( 5 ) 

J ^ i=n+l i=n+l 

for any real fj,i > 0, yU 2 > 0, and integer m > n > 0. 

The other example of statistically dual distribution is the normal distri- 
bution with mean value a and constant variance a 2 : 

1 (x-a) 2 

<p(x;a,a) = e , (6) 

V2"7rcr 



where x is real variable, a and a > are real parameters. Here we also can 
exchange the parameter a and variable x saving the formulized description 
of the probability density. It allows to estimate the value of parameter a by 
measurement of mean value for x. In this case we must consider new density 
of probability with the real variable a and real parameters x and a > 

1 (x-a) 2 

(j)(a;x,a) = — =^e . (7) 

V27T<7 
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Such a way the normal distribution can be named as statistical self-dual 
distribution. The analogous identity (as Eq.5) for given distributions takes 
place 



rd rd rc rc 

j <p(a;c,cr)da+ / (p(x; d, a)dx + / <p(a;d,a)da+ / (p(x; c, a)dx — 

Jc Jc Jd Jd 

or, simpler, 

rd rd 

J (j)(a;b,cr)da — J <p(x; b, a)dx = (8) 

for any real b, c and d. 

Let us present (Eadie 1971) the probability density of (3— distribution as 

P(x;n,m)= ^ + ™ | ^' ^(l - x) m , (9) 
n'.m'. 

with real variable x (0 < x < 1) and integer non negative parameters n and 
m, and the probability distribution of negative binomial distribution as 

P(k;n,p)= { ^±^p n+1 (l-p) k , (10) 

where k is integer non negative variable, integer n and real p are parameters 
(0 < p < 1). 

The formulae, which describe these distributions, are different, neverthe- 
less, as is mentioned in ref. (Jaynes 1976), the negative binomial (Pascal) 
and (3— distributions have interrelation in the form of identity 

/ (3(x; n, m)dx — V" P(k; n, p) = 0. (11) 

Correspondingly, these distributions can be named as statistically quasi- 
dual distributions. 

3 Statistical duality and estimation of the pa- 
rameter of distribution 

The identity (Eq.5) can be written in form (Bityukov 2000, Bityukov 2003) 
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E f(i\ + / 9Md» + E /(<; #0 = i, (12) 

i=n+l w i=0 

i.e. 

i=n+l 4 - i=0 J - 

for any real [i\ > and ^2 > and non negative integer n. 

The definition of the confidence interval (to> to) for Poisson distribution 
parameter /x as (Bityukov 2000) 

P(to < V < to) = P(i < n\to) - P(i < n\fi 2 ), (13) 

where P(i < n\fj) = E — n — 5 allows to show that a Gamma-distribution 

i=0 f - 

Ti.i+n is the probability distribution of different values of ji parameter of 
Poisson distribution on condition that the observed value of the number of 
events is equal to n. This definition is consistent with identity Eq.12 in con- 
trast with another frequentist definitions of confidence interval (for example, 
if we suppose in Eq.12 that /ii = /i 2 we have conservation of probability). 
The right part of Eq.13 determines the frequentist sense of this definition. 

Let us suppose that <fa(/z) is the probability density of parameter of the 
Poisson distribution if number of observed events is equal to n. It is a condi- 
tional probability density. As it follows from formulae (Eqs.1,12), the <fa(/i) 
is the density of Gamma-distribution by definition. Note, that the definition 
of conjugate families 1 is based on using of probability density of parameter 
distribution (Casella, 2001). 

On the other hand: if gn(fJ-) is n °t equal to this probability density and 
the probability density of the Poisson parameter is the other function h(fj,; h) 
then there takes place another identity 

00 f(ta h 

E /(»; to) + / Kto h)dv + E /(»; to) = i. (14) 

i=h+l m i=0 

1 Given a family T of pdf's f(x\8) indexed by a parameter 0, then a family, II of prior 
distributions is said to be conjugate for the family T if the posterior distribution of 8 is in 
the family II for all / e T, all priors ir(6) € H and all possible data sets x 
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This identity is correct for any real [i\ > and fi 2 > too. The sums 
in the left part of this equation determine the boundary conditions on the 
confidence interval. 

If we subtract Eq.14 from Eq.12 then we have 



We can choose the Hi and \ii by the arbitrary way. Let us make this choice 
so that gn(fj) is not equal h(^i;h) in the interval (/ii,/i 2 ) and, for example, 
gn(fJ-) > h(fx; h) and /x 2 > In this case we have 



and we have contradiction (i.e. gh{^) = h(/i;n) everywhere except, may be, 
a finite set of points). As a result we can mix Bayesian (</n(/j)) and frequen- 
tist (f(k;fi)) probabilities without any logical inconsistencies. The identity 
(Eq.12) leaves no place for any prior except uniform (7r(/i) = const 2 ). As 
a result, we can construct the distribution of the errors in the estimation 
of the parameter of Poisson distribution by single measurement of a casual 
variable and, correspondingly, the confidence intervals, estimate the param- 
eter by several measurements, take into account systematics and statistical 
uncertainties of measurements at statistical conclusions about the quality of 
planned experiments (Bityukov 2000, Bityukov 2003, Bityukov 2004). 

For the normal distribution the identity (Eq.8) can be written in case of 
the observed value of random variable x as 

/x— c rx+d roo 

ip(x; x, a)dx + / <p(a; x, a)da + / ip(x; x, a)dx = 1 (17) 
-oo Jx—c Jx+d 

for any real c and d or, in analogy with Eq.12 for the case of Poisson-Gamma 
distributions, 

rco rx+d r-x 

\ (p(x;x — c,cr)dx+ / (p(a;x,a)da+ / (f(x; x + d, a)dx = 1 (18) 

Jx JX—C J —CO 




(15) 




(16) 



2 Bayesian methods are supposed that gh{^) = 
prior probability density for fi. 



f(n;n) -7r(/i) 



where tt(^l) is the 



/ f(h;fj,) -7r(/i)d/i 
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for any real c > and d > 0. 

These identities (18,19) also allow to say that conditional distribution (if 
observed value is x) of true value of the parameter a obeys normal distribution 
with mean value x and constant variance a 2 (i.e. here, in contrast with 
previous example, x is the unbiased estimator of the parameter a). As a result 
we can construct the distribution of the error and the confidence intervals of 
parameter a, take into account systematics and statistical uncertainties and 
so on in accordance with standard analysis of errors (Eadie 1971). 

So, the statistical duality allows to connect the estimation of the param- 
eter with the measurement of the random variable of the distribution under 
study. 

4 Conclusion 

In the paper the notion of the statistically dual distributions is introduced. 
The relation between the measurement of casual variable and the estima- 
tion of the given distribution parameter for two pairs of statistically dual 
distributions is presented. The proposed approach allows to construct the 
distribution of measurement error of the estimator of distribution parameter 
by the using of statistically dual distribution. 

Both considered cases of statistically dual distributions (which are intro- 
duced in frame of frequentist approach), namely Poisson distribution versus 
Gamma distribution and Normal distribution versus Normal distribution, 
also belong to conjugate families (which are defined in frame of Bayesian 
approach). For example (Bityukov 2003 AC AT), the distributions conju- 
gated to Poisson distributions were built by Monte Carlo method (i.e. in 
frame of frequentist approach). The hypotheses testing confirms that these 
distributions are Gamma-distributions as expected in this case. 
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